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Abstract. In this paper, we explore the use of the diffusion geometry 
framework for the fusion of geometric and photometric information in 
local and global shape descriptors. Our construction is based on the def- 
inition of a diffusion process on the shape manifold embedded into a 
high-dimensional space where the embedding coordinates represent the 
photometric information. Experimental results show that such data fu- 
sion is useful in coping with different challenges of shape analysis where 
pure geometric and pure photometric methods fail. 

1 Introduction 

In last decade, the amount of geometric data available in the public domain, 
such as Google 3D Warehouse, has grown dramatically and created the demand 
for shape search and retrieval algorithms capable of finding similar shapes in 
the same way a search engine responds to text queries. However, while text 
search methods are sufficiently developed to be ubiquitously used, the search 
and retrieval of 3D shapes remains a challenging problem. Shape retrieval based 
on text metadata, like annotations and tags added by the users, is often incapable 
of providing relevance level required for a reasonable user experience (see Figure 

Content-based shape retrieval using the shape itself as a query and based 
on the comparison of geometric and topological properties of shapes is com- 
plicated by the fact that many 3D objects manifest rich variability, and shape 
retrieval must often be invariant under different classes of transformations. A 
particularly challenging setting is the case of non-rigid shapes, including a wide 
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Fig. 1. The need for content-based shape retrieval: text-based search engine such as 
Google 3D Warehouse returns shapes of dogs as well as hot-dog cabins in response to 
the query "dog". The later is obviously irrelevant. 



range of transformations such as bending and articulated motion, rotation and 
translation, scaling, non-rigid deformation, and topological changes. The main 
challenge in shape retrieval algorithms is computing a shape descriptor, that 
would be unique for each shape, simple to compute and store, and invariant 
under different type of transformations. Shape similarity is determined by com- 
paring the shape descriptors. 

Prior works. Broadly, shape descriptors can be divided into global and 
local. The former consider global geometric or topological shape characteristics 
such as distance distributions [21124119] . geometric moments [14130] . or spectra 
[2"3] , whereas the latter describe the local behavior of the shape in a small patch. 
Popular examples of local descriptors include spin images [3J, shape contexts 
PP, integral volume descriptors [T5] and radius-normal histograms [52]. Using 
the bag of features paradigm common in image analysis |25|10j . a global shape 
descriptor counting the occurrence of local descriptors in some vocabulary can 
be computed [7]. 

Recently, there has been an increased interest in the use of diffusion geom- 
etry |11I16[ for constructing invariant shape descriptors. Diffusion geometry is 
closely related to heat propagation properties of shapes and allows obtaining 
global descriptors, such as distance distributions |24ll9l8j and Laplace-Beltrami 
spectral signatures [53], as well local descriptors such as heat kernel signatures 
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[26 9 . In particular, heat kernel signatures 26 showed very promising results in 
large-scale shape retrieval applications [7]. 

One limitation of these methods is that, so far, only geometric information 
has been considered. However, the abundance of textured models in computer 
graphics and modeling applications, as well as the advance in 3D shape acqui- 
sition [35 36 allowing to obtain textured 3D shapes of even moving objects, 
bring forth the need for descriptors also taking into consideration photometric 
information. Photometric information plays an important role in a variety of 
shape analysis applications, such as shape matching and correspondence |28l33j . 
Considering 2D views of the 3D shape |32l20j . standard feature detectors and 
descriptors used in image analysis such as SIFT [TS] can be employed. More 
recently, Zaharescu et al. [37] proposed a geometric SIFT-like descriptor for tex- 
tured shapes, defined directly on the surface. 

Main contribution. In this paper, we extend the diffusion geometry frame- 
work to include photometric information in addition to its geometric counterpart. 
This way, we incorporate important photometric properties on one hand, while 
exploiting a principled and theoretically established approach on the other. The 
main idea is to define a diffusion process that takes into consideration not only 
the geometry but also the texture of the shape. This is achieved by considering 
the shape as a manifold in a higher dimensional combined geometric-photometric 
embedding space, similarly to methods in image processing applications |15ll7j . 
As a result, we are able to construct local descriptors (heat kernel signatures) 
and global descriptors (diffusion distance distributions). The proposed data fu- 
sion can be useful in coping with different challenges of shape analysis where 
pure geometric and pure photometric methods fail. 

2 Background 

Throughout the paper, we assume the shape to be modeled as a two-dimensional 
compact Riemannian manifold X (possibly with a boundary) equipped with a 
metric tensor g. Fixing a system of local coordinates on X, the latter can be 
expressed as a 2 x 2 matrix g^ v , also known as the first fundamental form. The 
metric tensor allows to express the length of a vector v in the tangent space 
T X X at a point x as g^v^v^, where repeated indices [A, v = 1,2 are summed 
over following Einstein's convention. 

Given a smooth scalar field / : X — > E on the manifold, its gradient is defined 
as the vector field V/ satisfying f(x + dx) = f(x)+g x (Vf(x),dx) for every point 
x and every infinitesimal tangent vector dx G T X X. The metric tensor g defines 
the Laplace- Beltrami operator A g that satisfies 



for any pair of smooth scalar fields /, h : X — »• K; here da denotes integration 
with respect to the standard area measure on X. Such an integral definition is 
usually known as the Stokes identity. The Laplace-Beltrami operator is positive 




(1) 
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semi- definite and self-adjoint. Furthermore, it is an intrinsic property of X, i.e., 
it is expressible solely in terms of g. In the case when the metric g is Euclidean, 
A g becomes the standard Laplacian. 

The Laplace-Beltrami operator gives rise to the heat equation, 

(^ + |)« = 0, (2) 

which describes diffusion processes and heat propagation on the manifold. Here, 
u(x, t) denotes the distribution of heat at time t at point x. The initial condition 
to the equation is some heat distribution u(x,0), and if the manifold has a 
boundary, appropriate boundary conditions (e.g. Neumann or Dirichlet) must 
be specified. The solution of ([2| with a point initial heat distribution uq (x) = 
5 (x — x') is called the heat kernel and denoted here by ht(x, x'). Using a signal 
processing analogy, ht can be thought of as the "impulse response" of the heat 
equation. 

By the spectral decomposition theorem, the heat kernel can be represented 
as Q3] 

h t (x,x') = Y / e- x * t cf> i (x)<t>i(x), (3) 

i>0 

where = A < Ai < . . . are the eigenvalues and 4> ,4>i, ■ ■ ■ the corresponding 
eigenfunctions of the Laplace-Beltrami operator (i.e., solutions to A g $i = Xi<pi). 
The value of the heat kernel ht(x, x') can be interpreted as the transition proba- 
bility density of a random walk of length t from the point x to the point x' . This 
allows to construct a family of intrinsic metrics known as diffusion metrics^ 

df(x,x') = J {h t (x, ■) - h t (x', -)) 2 da 

^e-^OMaO-'M*')) 2 - (4) 

These metrics have an inherent multi-scale structure and measure the "connec- 
tivity rate" of the two points by paths of length t. We will collectively refer to 
quantities expressed in terms of the heat kernel or diffusion metrics as to dif- 
fusion geometry. Since the Laplace-Beltrami operator is intrinsic, the diffusion 
geometry it induces is invariant under isometric deformations of X (incongruent 
embeddings of g into R 3 ). 

3 Fusion of geometric and photometric data 

Let us further assume that the Ricmannian manifold X is a submanifold of some 
manifold £ (dim(£) = m > 2) with the Ricmannian metric tensor h, embedded 

1 Note that here the term metric is understood in the sense of metric geometry rather 
than the Riemannian inner product. To avoid confusion, we refer to the latter as to 
metric tensor throughout the paper. 
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by means of a diffeomorphism £ : X — > £. A Riemannian metric tensor on X 
induced by the embedding is the pullback metric (£*/i)(r, s) — h(d£(r) , <i£(s)) 
for r,s £ T X X, where <i£ : T X X — > T^mS is the differential of £. In coordinate 
notation, the pullback metric is expressed as (^h)^ = hijd^d^ 3 , where the 
indices i, j = 1, . . . ,m denote the embedding coordinates. 

Here, we use the structure of £ to model joint geometric and photometric in- 
formation. Such an approach has been successfully used in image processing [15 . 
When considering shapes as geometric object only, we define £ = K 3 and h to be 
the Euclidean metric. In this case, £ acts as a parametrization of X and the pull- 
back metric becomes simply = d fl £ 1 d u £ 1 + . . -+d^ 3 d v ^ 3 = (d^,d v ^) n a. 
In the case considered in this paper, the shape is endowed with photometric infor- 
mation given in the form of a field a : X — > C , where C denotes some colorspace 
(e.g., RGB or Lab). This photometric information can be modeled by defining 
£ = W 3 x C and an embedding £ = (£ ff ,£ p ). The embedding coordinates corre- 
sponding to geometric information £ ff = (£ , . . . , £ 3 ) are as previously and the 
embedding coordinate corresponding to photometric information are given by 
£ p = (£ 4 , . . . , £ 6 ) = ^(a 1 , . . . , a 3 ), where rj > is a scaling constant. Simplifying 
further, we assume C to have a Euclidean structure (for example, the Lab col- 
orspace has a natural Euclidean metric). The metric in this case boils down to 
(Ch)^ = (9^g,d u ^ g ) M a +if(d f j,£ p ,d l/ £p) M 3, which hereinafter we shall denote 
by g^v 

The Laplace-Beltrami operator A g associated with such a metric gives rise to 
diffusion geometry that combines photometric and geometric information (Fig- 
ure^. 

Invariance. It is important to mention that the joint metric tensor g and 
the diffusion geometry it induces have inherent ambiguities. Let us denote by 
Iso g = Iso((£*/i) M „) and Iso p = Iso((^*/i) /J!/ ) the respective groups of transfor- 
mation that leave the geometric and the photometric components of the shape 
unchanged. We will refer to such transformations as geometric and photomet- 
ric isometries. The diffusion metric induced by g is invariant the joint isometry 
group Isog = Iso((£*/i) M1/ +r7 2 (£*/i) M)y ). Ideally, we would like Iso g = lso g xIso p to 
hold. In practice, Iso^ is bigger: while every composition of a geometric isometry 
with a photometric isometry is a joint isometry, there exist some joint isome- 
tries which cannot be obtained as a composition of geometric and photometric 
isometries. An example of such transformations is uniform scaling of (£*/i) MJ/ 
combined with compensating scaling of {Cph)^ v - The ambiguity stems from the 
fact that IsOg is bigger compared to Iso 9 x Iso p . Experimental results show that 
no realistic geometric and photometric transformations lie in Iso^ \ (Iso g x Iso p ), 
however, a formal characterization of the isometry group is an important theo- 
retical question for future research. 

4 Numerical implementation 

Let {xi, . . . , xn} C X denote the discrete samples of the shape, and £(a;i), . . . , £(xn) 
be the corresponding embedding coordinates (three-dimensional in the case we 
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Fig. 2. Textured shape (left); values of the heat kernel (x placed on the foot, t — 
1024) arising from regular purely geometric (middle) and mixed photometric-geometric 
(right) diffusion process. 

consider only geometry, or six-dimensional in the case of geometry-photometry 
fusion). We further assume to be given a triangulation (simplicial complex), con- 
sisting of edges and faces (i,j,k) where each (i,j),(j,k), and (i,fc) is an 
edge (here i,j,k — l,... ,N). 

Discrete Laplacian. A function / on the discretized manifold is repre- 
sented as an A^-dimensional vector (f(xi), . . . , /(xn))- The discrete Laplace- 
Beltrami operator can be written in the generic form 

(Af)(xi) = - Vii(f(*i) - /C"i)), (5) 

where ife are weights, are normalization coefficients, and A/i denotes a lo- 
cal neighborhood of point i. Different discretizations of the Laplace-Beltrami 
operator can be cast into this form by appropriate definition of the above con- 
stants. For shapes represented as triangular meshes, a widely-used method is the 
cotangent scheme, which preserves many important properties of the continuous 
Laplace-Beltrami operator, such as positive semi-dcfinitcncss, symmetry, and 
locality [31]. Yet, in general, the cotangent scheme does not converge to the con- 
tinuous Laplace-Beltrami operator, in the sense that the solution of the discrete 
eigenproblem does not converge to the continuous one (pointwise convergence 
exists if the triangulation and sampling satisfy certain conditions |34j). 

Belkin et al. [5 proposed a discretization which is convergent without the 
restrictions on "good" triangulation required by the cotangent scheme. In this 
scheme, A/J is chosen to be the entire sampling {xi, . . . ,xn}, a« = 4^2, and 

w ij — Sje~^ x *^~^ x ^ / 4p , where p is a parameter. In the case of a Euclidean 
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colorspace, Wij can be written explicitly as 



Wij = Sj exp 



{ 




(6) 



where a = p/r/ 2 , which resembles the weights used in the bilateral filter [29] . Ex- 
perimental results also show that this operator produces accurate approximation 
of the Laplace-Beltrami operator under various conditions, such as noisy data 
input and different sampling |27l5j . 

Heat kernel computation. In matrix notation, equation ([5| can be writ- 
ten as Af = A~ 1 Wf, where A = diag(aj) and W — diag (X^i w uj ~ ( w ij)- 

The eigenvalue problem A$> = A<P is equivalent to the generalized symmetric 
eigenvalue problem W<P = AA<P, where A — diag(A , . . . , \ K ) is the diagonal 
matrix of the first K eigenvalues, and <S> = (</>o, . . . ,4>k) is the matrix of the 
eigenvectors stacked as columns. Since typically W is sparse, this problem can 
be efficiently solved numerically. 

Heat kernels can be approximated by taking the first largest eigenvalues and 
the corresponding eigenf unctions in (|3]). Since the coefficients in the expansion 
of ht decay as 0(e~ t ), typically a few eigenvalues {K in the range of 10 to 100) 
are required. 

5 Results and applications 

In this section, we show the application of the proposed framework to retrieval 
of textured shapes. We compare two approaches: bags of local features and dis- 
tributions of diffusion distances. 

5.1 Bags of local features 



ShapeGoogle framework. Sun et al. [26 proposed using the heat propagation 

properties as a local descriptor of the manifold. The diagonal of the heat kernel, 



referred to as the heat kernel signature (HKS), captures the local properties of 
X at point x and scale t. The descriptor is computed at each point as a vector 
of the values p(x) = (K tl (x,x), . . . , Kt n (x,x)), where t\, . . . ,t n are some time 
values. Such a descriptor is deformation-invariant, easy to compute, and provably 
informative [26j . 

Ovsjanikov et al. [7] employed the HKS local descriptor for large-scale shape 
retrieval using the bags of features paradigm |25) . In this approach, the shape 
is considered as a collection of "geometric words" from a fixed "vocabulary" 
and is described by the distribution of such words, also referred to as a bag of 
features or BoF. The vocabulary is constructed offline by clustering the HKS 





(7) 
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descriptor space. Then, for each point on the shape, the HKS is replaced by 
the nearest vocabulary word by means of vector quantization. Counting the 
frequency of each word, a BoF is constructed. The similarity of two shapes 
X and Y is then computed as the distance between the corresponding BoFs, 
d(X,Y) = \\BoF x -BoF y ||. 

Using the proposed approach, we define the color heat kernel signature (cHKS), 
defined in the same way as HKS with the standard Laplace-Belrami operator 
replaced by the one resulting from the geometric-photometric embedding. In 
the following, we show that such descriptors allow achieving superior retrieval 
performance. 

Evaluation methodology. In order to evaluate the proposed method, we 
used the SHREC 2010 robust large-scale shape retrieval benchmark methodol- 
ogy (§]. The query set consisted of 270 real- world human shapes from 5 classes 
acquired by a 3D scanner with real geometric transformations and simulated 
photometric transformations of different types and strengths, totalling in 54 
instances per shape (Figure [3]). Geometric transformations were divided into 
isometry+topology (real articulations and topological changes due to acquisition 
imperfections), and partiality (occlusions and addition of clutter such as the red 
ball in Figure [3]) . Photometric transformations included contrast (increase and 
decrease by scaling of the L channel), brightness (brighten and darken by shift 
of the L channel), hue (shift in the a channel), saturation (saturation and desat- 
uration by scaling of the a, b channels), and color noise (additive Gaussian noise 
in all channels) . Mixed transformations included isometry+topology transforma- 
tions in combination with two randomly selected photometric transformations. 
In each class, the transformation appeared in five different versions numbered 
1-5 corresponding to the transformation strength levels. One shape of each of 
the five classes was added to the queried corpus in addition to other 75 shapes 
used as clutter (Figure Q. 

Retrieval was performed by matching 270 transformed queries to the 75 
null shapes. Each query had exactly one correct corresponding null shape in 
the dataset. Performance was evaluated using the precision- recall character- 
istic. Precision P(r) is defined as the percentage of relevant shapes in the 
first r top-ranked retrieved shapes. Mean average precision (mAP), defined as 
mAP — P(r) ■ rel(r), where rel(r) is the relevance of a given rank, was used 
as a single measure of performance. Intuitively, mAP is interpreted as the area 
below the precision-recall curve. Ideal retrieval performance results in first rele- 
vant match with mAP=100%. Performance results were broken down according 
to transformation class and strength. 

Methods. In additional to the proposed approach, we compared purely 
geometric, purely photometric, and joint photometric-geometric descriptors. As 
a purely geometric descriptor, we used bags of features based on HKS accord- 
ing to [7J; purely photometric shape descriptor was a color histogram. As joint 
photometric-geometric descriptors, we used bags of features computed with the 
MeshHOG [37] and the proposed color HKS (cHKS). 
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Fig. 3. Examples of geometric and photometric shape transformations used as queries 
(shown at strength 5). First row, left to right: null, isometry+topology, partiality, two 
brightness transformations (brighten and darken), two contrast transformations (in- 
crease and decrease contrast). Second row, left to right: two saturation transformations 
(saturate and desaturate), hue, color noise, mixed. 



For the computation of the bag of features descriptors, we used the Shape 
Google framework with most of the settings as proposed in [7] . More specifically, 
HKS were computed at six scales (t = 1024,1351.2,1782.9,2352.5, and 4096). 
Soft vector quantization was applied with variance taken as twice the median of 
all distances between cluster centers. Approximate nearest neighbor method [2] 
was used for vector quantization. The Laplace-Beltrami operator discretization 
was computed using the Mesh-Laplace scheme [3] with scale parameter p = 2. 
Heat kernels were approximated using the first 200 eigenpairs of the discrete 
Laplacian. The MeshHOG descriptor was computed at prominent feature points 
(typically 100-2000 per shape), detected using the MeshDOG detector [37]. The 
vocabulary size in all the cases was set to 48. 

In cHKS, in order to avoid the choice of an arbitrary value 77, we used a 
set of three different weights (77 = 0,0.05,0.1) to compute the cHKS and the 
corresponding BoFs. The distance between two shapes was computed as the 
sum of the distances between the corresponding BoFs for each 77, weighted by 77, 
and 1 in case of 77 = 0, d(X, Y) = \\BoF° x - BoF^||f + £ n J7||BoF^ - BoF^||f . 

Results. Tables [l}|4] summarize the results of our experiments. Geometry 
only descriptor (HKS) [7 is invariant to photometric transformations, but is 
somewhat sensitive to topological noise and missing parts (Table [l]) . On the 
other hand, the color-only descriptor works well only for geometric transforma- 
tions that do not change the shape color. Photometric transformations, however, 
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Fig. 4. Null shapes in the dataset (shown at arbitrary scale for visualization purposes). 

make such a descriptor almost useless (Table pj). MeshHOG is almost invariant 
to photometric transformations being based on texture gradients, but is sensitive 
to color noise (Table [3|. The fusion of the geometric and photometric data using 
our approach (Tablel4| achieves nearly perfect retrieval for mixed and photo- 
metric transformations and outperforms other approaches. Figure [5] visualizes 
a few examples of the retrieved shapes ordered by relevance, which is inversely 
proportional to the distance from the query shape. 



5.2 Shape distributions 

Spectral shape distances. Recent works |24|19j showed that global shape 
descriptors can be constructed considering distributions of intrinsic distances. 
Given some intrinsic distance metric dx , its cumulative distribution is computed 
as 



(8) 



Fx (8) = / Xd x (x,x>)<& da(x)da(x'), 



where \ denotes an indicator function. Given two shapes X and Y with the 
corresponding distance metrics dx , dy , the similarity (referred to as spectral 
distance) is computed as a distance between the corresponding distributions Fx 
and Fy. 

Using the proposed framework, we construct diffusion distances according 
to Q, where the standard Laplace-Beltrami operator is again replaced by the one 
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Strength 

Transform. I <2 <3 <4 <5 
Isom+Topo 100.00 100.00 96.67 95.00 90.00 
Partial 66.67 60.42 63.89 63.28 63.63 

Contrast 100.00 100.00 100.00 100.00 100.00 
Brightness 100.00 100.00 100.00 100.00 100.00 
Hue 100.00 100.00 100.00 100.00 100.00 

Saturation 100.00 100.00 100.00 100.00 100.00 

Noise 100.00 100.00 100.00 100.00 100.00 

Mixed 90.00 95.00 93.33 95.00 96.00 

Table 1. Performance (mAP in %) of ShapeGoogle using BoFs with HKS descriptors. 
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25.99 
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Table 2. Performance (mAP in %) of color histograms. 



Strength 

Transform. I <2 <3 <4 <5~~ 
Isom+Topo 100.00 95.00 96.67 94.17 95.33 
Partial 75.00 61.15 69.93 68.28 68.79 

Contrast 100.00 100.00 100.00 98.33 94.17 
Brightness 100.00 100.00 100.00 100.00 99.00 
Hue 100.00 100.00 100.00 100.00 100.00 

Saturation 100.00 100.00 100.00 98.75 99.00 

Noise 100.00 100.00 88.89 83.33 78.33 

Mixed 100.00 100.00 100.00 93.33 83.40 

Table 3. Performance (mAP in %) of BoFs using MeshHOG descriptors. 



associated with the geometric-photometric embedding. Such distances account 
for photometric information, and, as we show in the following, show superior 
performance. 

Methods. Using the same benchmark as above, we compared shape retrieval 
approaches that use distance distribution as shape descriptors. Two methods 
were compared: pure geometric and joint geometric-photometric distances. In 
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Strength 



Transform. 1 <2 <3 <4 <5 

Isom+Topo 100.00 100.00 96.67 97.50 94.00 

Partial 68.75 68.13 69.03 67.40 67.13 

Contrast 100.00 100.00 100.00 100.00 100.00 

Brightness 100.00 100.00 100.00 100.00 100.00 

Hue 100.00 100.00 100.00 100.00 100.00 

Saturation 100.00 100.00 100.00 100.00 100.00 

Noise 100.00 100.00 100.00 100.00 100.00 

Mixed 100.00 100.00 96.67 97.50 98.00 

Table 4. Performance (mAP in %) of ShapeGoogle using w-multi-scale BoFs with 

cHKS descriptors. 



Strength 



Transform. 1 <2 <3 <4 <5 

Isom+Topo 80.00 90.00 88.89 86.67 89.33 
Partial 56.25 65.62 61.61 58.71 61.13 

Contrast 100.00 100.00 100.00 100.00 100.00 
Brightness 100.00 100.00 100.00 100.00 100.00 
Hue 100.00 100.00 100.00 100.00 100.00 

Saturation 100.00 100.00 100.00 100.00 100.00 
Noise 100.00 100.00 100.00 100.00 100.00 

Mixed 66.67 73.33 78.89 81.67 81.33 

Table 5. Performance (mAP in %) of pure geometric spectral shape distance J9J. 



the former, we used average of diffusion distances 

d(x,x') = r=- ^2dt(x,x'), (9) 
' ' teT 

computed at two scales, T = {1024, 4096}. In the latter, the distances were also 
computed at multiple scales r\ of the photometric component, 

d(x,x') = ~J2Il d t,v(x>x')' (10) 

' ' t<£Tr)£H 

The values H = {0,0.1,0.2} were used. For the computation of distributions, 
the shapes were subsampled at 2500 points using the farthest point sampling 
algorithm. 

Results. Tables [5]-|6] summarize the results. Both descriptors appear insen- 
sitive to photometric transformations. The joint distance has superior perfor- 
mance in pure geometric and mixed transformations. We conclude that the use 
of non-zero weight for the color component adds discriminativity to the distance 
distribution descriptor, while being still robust under photometric transforma- 
tions. 
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1 BHi ! it! 

HKS BoF [7] Color histogram cHKS multiscale BoF 



Fig. 5. Retrieval results using different methods. First column: query shapes, second 
column: first three matches obtained with HKS-based BoF 171 , third column: first three 
matches obtained using color histograms, fourth column: first three matches obtained 
with the proposed method (cHKS-based multiscale BoF). Shape annotation follows the 
convention shapeid. transformation, strength; numbers below show distance from query. 
Only a single correct match exists in the database (marked in green), and ideally, it 
should be the first one. 



6 Conclusions 



In this paper, we explored a way to fuse geometric and photometric information 
in the construction of shape descriptors. Our approach is based on heat propa- 
gation on a manifold embedded into a combined geometry-color space. Such dif- 
fusion processes capture both geometric and photometric information and give 
rise to local and global diffusion geometry (heat kernels and diffusion distances), 
which can be used as informative shape descriptors. We showed experimentally 
that the proposed descriptors outperform other geometry-only and photometry- 
only descriptors, as well as state-of-the-art joint geometric-photometric descrip- 
tors. In the future, it would be important to formally characterize the isometry 
group induced by the joint metric in order to understand the invariant proper- 
ties of the proposed diffusion geometry, and possibly design application-specific 
invariant descriptors. 
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